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Abstract 

In this paper, we consider two important problems defined on finite metric spaces, and provide 
efficient new algorithms and approximation schemes for these problems on inpnts given as graph 
shortest path metrics or high-dimensional Euclidean metrics. The first of these problems is the 
greedy permutation (or farthest-first traversal) of a finite metric space: a permutation of the 
points of the space in which each point is as far as possible from all previous points. We describe 
randomized algorithms to find (1 -|-e)-approximate greedy permutations of any graph with n vertices 
and m edges in expected time 0[s~^{m + n) lognlog(n/e)), and to find (1 -|-e)-approximate greedy 
permutations of points in high-dimensional Euclidean spaces in expected time 

Additionally we describe a deterministic algorithm to find exact greedy permutations of any graph 
with n vertices and treewidth 0(1) in worst-case time 0(n^O n). The second of the two 

problems we consider is distance selection: given A; G [( 2 )!’ interested in computing the 

feth smallest distance in the given metric space. We show that for planar graph metrics one can 
approximate this distance, up to a constant factor, in near linear time. 

1. Introduction 

In this paper we are interested in several important algorithmic problems on finite metric spaces, including 
the construction of greedy permutations, the problem of selecting the fcth distance among all pairs of 
points in the space, and the problem of counting the number of points in a metric ball. These problems 
have known polynomial time algorithms (for instance, the fcth distance may be found by applying a 
selection algorithm to the coefficients of the distance matrix); however, we are interested in algorithms 
that scale well to large data sets, so we seek algorithms that take subquadratic time (substantially 
smaller than the time to list all distances). To achieve this, we require the metric space to be defined 
implicitly, for instance as the distances in a sparse weighted graph or as the distances among points in a 
Euclidean metric space. Despite the increased difficulty of working with implicit metrics, we show that 
the problems we study can be solved efficiently. 
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Figure 1: Left: The first five points of a greedy permutation. Each point is as far as possible from all 
previously chosen points. Right: The six red points at the centers of the disks form an r-net of the set V 
of red and white points, where r is the radius of the large yellow disks. The darker disks of radius r/2 
are disjoint from each other, and the disks of radius r together cover all of V. 


1.1. Greedy permutation 


In the hrst sections of this paper we are interested in an ordering problem on metric spaces: the 
construction of greedy permutations. We solve this problem exactly and approximately, for the shortest 
path metrics of sparse weighted graphs and for high-dimensional Euclidean spaces. 

A permutation 11 = (tti, 112 , ■ ■ ■} of the vertices of a metric space (V, d) is a greedy permutation 
(also called a farthest-first traversal or farthest point sampling) if each vertex vTj is the farthest in V from 
the set rii-i = {vTi,..., 7rj_i} of preceding vertices (Figure 1, left). Greedy permutations were introduced 
by Rosenkrantz et al. [RSL77] for the “farthest insertion” traveling salesman heuristic, and used by 
Gonzalez [Gon85] to 2-approximate the fc-center. Different prehxes of the greedy permutation provide 
different multi-resolution clusterings of the input point set; see Figure 2p3. 

Greedy permutations are closely related to another concept for hnite metric spaces, r-nets. An r-net 
for the metric space (V, d) and the numerical parameter r is a subset M of the points of V such that 
no two of the points of JV are within distance r of each other, and such that every point of V is within 
distance r of a point of A/”. Equivalently, the closed r/2-balls centered at the points of A/" are disjoint, 
and the closed r-balls around the same points cover all of V (Figure 1, right). Each prehx of a greedy 
permutation is an r-net, for r equal to the minimum distance between points in the prehx, and for every 
r an r-net may be obtained as a prehx of a greedy permutation^. 

Greedy permutations may be computed for metric spaces in O(n^) time, and for graphs in the same 
time as all pairs shortest paths, by a naive algorithm (Section 2) that maintains the distances of all points 
from the selected points. The only previous improvement on the naive algorithm, by Har-Peled and 
Mendel [HM06] dehnes a concept of approximation for greedy permutations that we will also use. They 
showed that (1 -I- e) -greedy permutations can be computed in O(nlogn) time in metric spaces with 
constant doubling dimension; these are permutations 11 = { 711 , 112 ,...) for which there exists a sequence 
of numbers ri > r 2 > ■ ■ ■ such that 


(A) the maximum distance of a point of V from Ilj is in the range r*, (1 + e)ri , and 

(B) the distance between every two points u,v EHi is at least r*. 

In this paper we give approximation schemes for metric spaces dehned by sparse graphs, and high¬ 
dimensional Euclidean spaces, neither of which have constant doubling dimension. Greedy permutations 
for graph distances were previously mentioned by Gu et al. [GJGll], in connection with an application 


^The notion of nets is closely related to congruent disc packing, which was studied in the end of the 19th century by 
Thue (see [PA95, Chapter 3]). It is however natural to assume that the concept is much older, as it is related to numerical 
integration and discrepancy [Mat99]. 
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Figure 2: (A) A point set. (B)-(G) The different representations of this point set as a union of balls, as 
provided by different prehxes of the greedy permutation. As this demonstrates, one can think about a 
prefix of the greedy permutation as a partial representation of the point set that keeps improving as the 
prehx used gets longer. 


in molecular dynamics, but rejected by them because the naive algorithm was too slow. 

One reason for interest in greedy or (1 + e)-greedy permutations is that, in a single structure, they 
approximate an optimal clustering for all possible resolutions. Specihcally, the prefix of the hrst k vertices 
in such a permutation, provides, for any fc, a 2(1 + ej-approximation to the optimal fc-center clustering 
of G. (See Lemma 2.1 for an easy proof of this.) The fc-center problem may be 2-approximated in 
0{kn) time by computing the hrst k vertices of a greedy permutation, and is NP-hard to approximate 
to a ratio better than 2 [Gon85].^ For points in Euclidean spaces of bounded dimension a linear-time 
2-approximation is known [Har04, HR13]. Thorup [ThoOS] provided a fast fc-center approximation for 
graph shortest path metrics with a single choice of k. The fc-center problem may be solved exactly on 
trees and cactus graphs in O(nlogn) time [FJ83]. Voevodski et al. [VBR+10] use an algorithm closely 
related to greedy permutation to approximate a different clustering problem, fc-medians. 

Intuitively, every prehx of a greedy permutation is as informative as possible about the whole 
set, so greedy permutations form a natural ordering in which to stream large data sets. Because 
of these properties, greedy permutations have many additional applications, including color quanti¬ 
zation [Xia97], progressive image sampling [ELPZ97], selecting landmarks of probabilistic roadmaps 
for motion planning [MAB98], point cloud simplihcation [MD03], halftone mask generation [SMR04], 
hierarchical clustering [DL05], detecting isometries between surface meshes [LF09], novelty detection 
and time management for autonomous robot exploration [GGD12], industrial fault detection [AYE12], 
and range queries seeking diverse sets of points in query regions [AAYI+13]. 

^Another 2-approximation for the fc-center by Hochbaum and Shmoys [HS85] is often erroneously credited as being the 
origin of the farthest-first traversal method, but actually uses a different algorithm. 
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1.2. Distance selection and approximate range counting 

The distance selection problem, in computational geometry, has as input a set of points in the output 
is the kth smallest distance defined by a pair of points of P. It is believed that such exact distance 
selection requires time in the worst case [Eri95] , even in the plane (in higher dimensions the bound 

deteriorates). Recently, Har-Peled and Raichel [HR13] provided an algorithm that (1 + e:)-approximates 
this distance in 0 {n/e^) time. 

We are interested in solving the problem for the finite metric case. Specifically, consider a shortest 
path metric defined over a graph G with n vertices and m edges. Given k G [[(2)] = (l; • • • > (2) }’ 
would like to compute the fcth smallest distance in this shortest path metric. This problem was studied 
for trees [MTZC81], where Frederickson and Johnson [FJ83] provided a beautiful algorithm that works 
by using tree separators, and selection in sorted matrices [FJ84]. 

The “dual” problem to distance selection is distance counting. Here, given a distance r, the task is to 
count the number of pairs of points of P that are of distance < r. While the problems are essentially 
equivalent in the exact case, approximate distance counting seems to be significantly easier than selection. 

Throughout this paper, G = (R, E) will denote an undirected graph with n vertices and m edges, 
with non-negative edge weights that obey the triangle inequality. The shortest path distances in G 
induce a metric d. Specifically, for any u,v eV, let dciu^v) denotes the shortest path between u and v 
in G. Given a graph G = {V,E), and a query distance r, the set of r-short pairs is 

u ^ V and dG{u,v)<rY (1) 

In the distance counting problem, the task is to compute (or approximate) \P<r\. In the distance 
selection problem, given k E [[(2)] 5 the task is to compute (or approximate) the smallest r such that 
\P<r\ > k. 

Distance counting is easy to approximate using known techniques, since this problem is malleable 
to random sampling, see [Gohl4] and references therein. However, approximate distance selection is 
significantly harder, as random sampling can not be used in this case - indeed, trying to use approximate 
distance counting (or sketches approach as in [Gohl4]), may result in an arbitrarily bad approximation to 
the fcth distance, if the {k — l)th and [k + l)th distance are significantly smaller and larger, respectively, 
than the /cth distance (or similar sparse scenarios). 

1.3. New results 

Greedy permutation for sparse graphs. In Section 2, we show that an (1 4-£)-greedy permutation 
can be found for graphs with n vertices and m edges in time 0 {e~^m\ogn\og{n/e)) = 0{m)^. 

Approximate greedy permutation for high dimensional Euclidean space. In Section 3, we 
show that an approximate greedy permutation can be computed for a set of points in high-dimensional 
Fuclidean space. The algorithm runs in subquadratic time and has polynomial dependency on the 
dimension. Our approximations are based on finding r-nets (or in the Euclidean case approximate r-nets) 
for a geometric sequence of values of r 

In an earlier paper [HIS13], the authors showed that for high dimensional point sets one can get 
a sparse spanner. Applying the above algorithm for sparse graphs to this spanner yields a greedy 
permutation, but with significantly weaker bounds, as the stretch in the constructed spanner is at lease 2. 

^The O notation hides logarithmic factors in n and polynomial terms in 1/e. We assume throughout the paper that 
n = 0{m). 


P<r = \{U,V} CV 
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Exact greedy permutation for bounded tree-width. In Section 4, we show how to hnd an exact 
greedy permutation for graphs of bounded treewidth, in time by partitioning the input graph 

into small subgraphs separated from the rest of the graph by 0(1) vertices, and by using an orthogonal 
range searching data structure in each subgraph to hnd the farthest vertex from the already-selected 
vertices. 

Distance selection in planar graphs. In Section 5, we show how to 0(l)-approximate the fcth 
distance in a planar graph G. Specihcally, given k, the algorithm computes in near linear time, a number 
a, such that the fcth shortest distance in G is at least a, and at most 0{a). This algorithm uses a planar 
separator and distance oracles in an interesting way to count distances. 

2. Approximate greedy permutation on a sparse graph 

We are interested in approximating the greedy permutation for a graph G. Among other motivations, 
this provides a good approximation for /c-center clustering: 

Lemma 2.1. If li is a (1 + e)- greedy permutation of M. = {V,d), then, for all k, IIa; provides a 
2(1 -|- e) -approximation to the optimal k-center clustering and minimax diameter k-clustering of M.. 

Proof: By Property (A) of such a permutation (see Section 1.1), all points of V can be covered by balls of 
radius (1 4- e)rk centered at tti, ..., iTk] these balls have diameter < 2(1 4- e)rk- Let S' = Ilfc U {u}, where 
V is the farthest point in V from 11^. By the dehnition of rj and by Property (B) of these permutations, 
every two points in S have distance at least r^, so no k clusters of radius smaller than rj/2 or diameter 
smaller than r* can cover the k + 1 points in S. 

A naive algorithm for computing the greedy permutation maintains for each vertex v its distance 
to the set of centers picked so far, and uses these distances as priorities in a max-heap, which it uses 
to select each successive center, using Dijkstra’s algorithm to update the distances after each center is 
picked. There are n instantiations of Dijkstra’s algorithm, taking time 0{n{m 4- ulogn)). 

To improve performance, we may avoid adding a vertex v to the min-heap used within Dijkstra’s 
algorithm unless its tentative distance is smaller than iy, preventing the expansion of vertices for which 
the distance from Vi is no smaller than iy. This idea does not immediately improve the worst-case 
running time of the algorithm but will be important in our approximation algorithm. 

2.1. Computing an r-net in a sparse graph 

We compute an r-net in a sparse graph using a variant of Dijkstra’s algorithm with the sequence of 
starting vertices chosen in a random permutation. A similar idea was used by Mendel and Schwob 
[MS09] for a different problem; however, using this method for our problem involves a more complicated 
analysis. 

Let G = {y, E) be a weighted graph with n vertices and m edges, let r > 0, and let vTj be the ith 
vertex in a random permutation of V. For each vertex v we initialize S{v) to 4-C)0. In the Ah iteration, 
we test whether 6{7ii) > r, and if so we do the following steps: 

1. Add TTj to the resulting net Af. 

2. Set 6{7ii) to zero. 

3. Perform Dijkstra’s algorithm starting from vTj, modified as in Section 2to avoid adding a vertex u 
to the priority queue unless its tentative distance is smaller than the current value of 6{u). When 
such a vertex u is expanded, we set 6{u) to be its computed distance from tt*, and relax the edges 
adjacent to u in the graph. 
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The difference from the algorithm of Mendel and Schwob is that their algorithm initiates an instance of 
Dijkstra’s algorithm starting from every vertex TTj, whereas we do so only when S(7ri) > r. 

Lemma 2.2. The set M is an r-net in G. 

Proof: By the end of the algorithm, each v G V has S{v) < r, for S{v) is monotonically decreasing, and if 
it were larger than r when v was visited then v would have been added to the net. 

An induction shows that if £ = (5(u), for some vertex v, then the distance of v to the set TV is at most 
i. Indeed, for the sake of contradiction, let j be the (end of) the hrst iteration where this claim is false. 
It must be that tij G TV, and it is the nearest vertex in TV to v. But then, consider the shortest path 
between Tij and v. The modihed Dijkstra must have visited all the vertices on this path, thus computing 
6{v) correctly at this iteration, which is a contradiction. 

Finally, observe that every two points in TV have distance > r. Indeed, when the algorithm handles 
vertex v G TV, its distance from all the vertices currently in TV is > r, implying the claim. 

Lemma 2.3. Consider an execution of the algorithm, and any vertex u G V. The expected number of 
times the algorithm updates the value of 6{v) during its execution is 0(\ogn), and more strongly the 
number of updates is O(logn) with high probability. 

Proof: For simplicity of exposition, assume all distances in G are distinct. Let Si be the set of all the 
vertices x G V, such that the following two properties both hold: 

(A) d{x, v) < d{v, flj), where flj = {tti, .. ., TTj}. 

(B) If TTi+i = x then 6 {v) would change in the {i + l)th iteration. 

Let Si = |S'j|. Observe that Si ^ S 2 ^ ^ Sn, and = 0. 

In particular, let £j+i be the event that S{v) changed in iteration (i + 1) ~ we will refer to such an 
iteration as being active. If iteration {i + 1) is active then one of the points of Si is TTj+i. However, vrj+i 
has a uniform distribution over the vertices of S',, and in particular, if £j_|_i happens then Sj+i < Sj/2, 
with probability at least half, and we will refer to such an iteration as being lucky. (It is possible that 
Sj+i < Si even if £*+1 does not happen, but this is only to our beneht.) After O(logn) lucky iterations 
the set Si is empty, and we are done. Clearly, if both the Ah and jth iteration are active, the events 
that they are each lucky are independent of each other. By the Chernoff inequality, after clogu active 
iterations, at least [log 2 n] iterations were lucky with high probability, implying the claim. Here c is a 
sufficiently large constant. 

Interestingly, in the above proof, all we used was the monotonicity of the sets Si,... ,Sn, and the 
fact that if 5(u) changes in an iteration then the size of the set Si shrinks by a constant factor with good 
probability in this iteration. This implies that there is some flexibility in deciding whether or not to 
initiate Dijkstra’s algorithm from each vertex of the permutation, without damaging the number of times 
of the values of 6{v) are updated. We will use this flexibility later on. 

Lemma 2.4. Given a graph G = {V,E), with n vertices and m edges, the above algorithm computes an 
r-net of G in 0((n + m) logn) expected time. 

Proof: By Lemma 2.3, the two 6 values associated with the endpoints of an edge get updated O(logu) 
times, in expectation, during the algorithm’s execution. As such, a single edge creates 0(log?7,) decrease- 
key operations in the heap maintained by the algorithm. Each such operation takes constant time if we 
use Fibonacci heaps to implement the algorithm. 


6 


2.2. An approximation whose time depends on the spread 

Given a finite metric space (V, d) defined over a set V, its spread is the ratio between the maximnm 
and minimnm distance in the metric; formally, 

spread((i) = max dciu^v)/ min dciu^v). 

u,vGV,u^v u^vGVyU^v 

Let graph G = (V, E) and £ > 0 be given. Assnme for now that the minimum edge length is 1, and 
that the diameter of G is at most A. Set r* = A/(l + ey~^, for f = 1,..., M = ["logi^.^ A]. We compute 
a sequence of nets in a sequence of iterations. In the hrst iteration, compute an ri-net A/i of G, using 
Lemma 2.4. In the beginning of the ith iteration, for i > 1, let Si = Uj<jA/'j. Using Dijkstra, mark 
as used all vertices within distance r* of S^. Compute an rj-net A/i in G, modifying the algorithm of 
Lemma 2.4 by disallowing used vertices from being considered as net points. 

After completing these computations, combine the vertices into a single permutation in which the 
vertices of A/i form the zth contiguous block. Within this block, the ordering of the vertices of A/i is 
arbitrary. The following is easy to verify, and we omit the easy proof. 

Lemma 2.5. Let M = Mi U ■ ■ ■ U A/i, for an arbitrary t. Then the distance of every vertex n G G from 
M is at most rt, and the distance between any pair of vertices of the net is at least rt- 

That is, the net computed by the tth iteration is a “perfect” net. In between such blocks, it might 
be less then perfect. Formally, we have the following (again, we omit the relatively easy proof). 

Lemma 2.6. Let tt be the permutation computed by the above algorithm, and consider the ith vertex iii 
in this permutation. Assume that vTj G A/). Then we have the following guarantees: 

(A) The distance between any two vertices of Hi = {tti, ... ,7rj} is at least rt- 

(B) The distance of any vertex of v E V from Ilj, is at most rt-i = (1 + £)rt. 

Lemma 2.7. Let G = (U, E) be a graph, let e > 0, and let $ be the spread of G. Then, one can compute 
a (1 + e)-greedy permutation in 0{e~^{n + m) log n log <F) expected time. 

Proof: A 2-approximation to the diameter of G can be found by running Dijkstra’s algorithm from an 
arbitrary starting vertex. The minimum distance in G is achieved by an edge (since the edge weights 
are positive), so it can be computed in linear time. After scaling, we can use the above algorithm, with 
M = 0(log]^_,_j, $) = 0{e~^ log<h) iterations, each using a modihed version of the algorithm of Section 2.1. 
It is straightforward to modify the analysis to show that the each such iteration takes 0{{n m) logn) 
time in expectation. 


2.3. Eliminating the dependence on the spread 

An arbitrary graph G may not have small enough spread to apply the previous algorithm directly. In 
this case, following by-now standard methods for eliminating the dependence on spread (see Section 4 
of [MS09]), we simulate the algorithm more efficiently, using a value of £ smaller by a constant factor to 
make up for some additional approximation in our simulation. 

Consider an iteration of the above algorithm for distance r*. Edges longer than nvi can be ignored or 
(conceptually) deleted, as they cannot be used by the r-net algorithm of Section 2.1. Similarly, edges of 
length 0{eri/M) can be collapsed and treated as having length zero. Thus, an edge e of length ^ is active 

when Tj is in the interval , which happens for O (\ogij^^{M/e)) = 0(e:“^ log(n/e:)) iterations. Let 

mi be the number of active edges in the zth iteration, and let G* be the resulting graph, in which all 
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the edges of lengths < eri/'n? are contracted (the resnlting snper vertex is identihed with one of the 
original vertices), and all the edges of length > Tin are removed. Any singleton vertex in this graph is 
not relevant for compnting the permutation in this resolution, and it can be ignored. The running time 
of Lemma 2.4 on Gi is 0(mj logruj). 

When the algorithm moves to the next iteration, it needs to introduce into Gi all the new edges that 
become active. Using a careful implementation, this can be done in 0(1) amortized time, for any newly 
introduced edge. Similarly, edges that become inactive should be deleted. Of course, if there are no 
active edges, the algorithm can skip directly to the next resolution. This can be easily done, by putting 
the edges into a heap, sorted by their length, and adding the edges and removing them as the algorithm 
progresses down the resolutions. 

The overall expected running time of this algorithm is 0(^j logruj + mlogm). However, since ev¬ 
ery edge is active in 0 {e~^ login/e)) iterations, we get that the expected running time is 0 {e~^m lognlog(n/e)). 
We thus get the following. 

Theorem 2.8. Given a non-negatively weighted graph G = {V,E), with n vertices and m edges, and 
a parameter e > 0, one can compute a (1 -|- s)-approximate greedy permutation for G in expected time 
0 {e~^mlognlog{n/e)). 

2.4. /c-center clustering for bounded spread with integer weights 

Our greedy permutation algorithm for sparse graphs leads to a fast (2 + e)-approximation to the /c-center 
problem for graph metrics. In this case, it is possible to eliminate the dependence on e, giving a 
2-approximation (best possible as achieving a smaller approximation ratio is NP-hard). 

Theorem 2.9. Let G be a graph with n vertices and m edges, with positive integer weights on the edges, 
and spread d*. Given k, one can compute a 2-approximation to the optimal k-center clustering of G, in 
0(mlognlog<I)) time both in expectation and with high probability. 

Proof: Using Dijkstra’s algorithm starting from an arbitrary vertex in G, compute, in 0{m nlogn) 
time, a number A, such that diam(G) < A < 2diam(G). We next perform a binary search for the radius 
Topt of the optimal fc-center clustering of G, in the range 1,..., A. 

Given a candidate radius x, let r = 2x and compute an r-net of G using the algorithm of Lemma 2.4. 

It is easy to verify that if x > ropt, then the resulting net ff has at most k vertices in it. Indeed, consider 
all the vertices in G assigned to a single cluster G in the optimal fc-center clustering, and observe that 
diam(C') < 2ropt < r. Therefore, at most one vertex of G may belong to any r-net, so every r-net for 
this value of r has at most k vertices. On the other hand, if x is too small then the resulting r-net has 
more then k vertices. 

Thus, using the r-net procedure of Lemma 2.4 as a decider in a binary search yields the desired 
approximation algorithm. 

3. Approximate greedy permutation on Euclidean metrics 

3.1. Approximate nets 

Lemma 3.1 (Johnson—Lindenstrauss lemma [JL84]). For any e > 0, every set of n points in 
Euclidean space admits an embedding into || . jj^)^ yjHh distortion l-\-e. 

Definition 3.2. Let iP be a family of hash functions mapping to some universe U. We say that H is 
{6, c6,pi,P 2)-sensitive if for any x,y it satishes the following properties: 


(A) If llx - y\\2 < 5 then FThenihix) = h{y)] > pi. 

(B) If ||x - y \\2 > c6 then PYheH[h{x) = h{y)] < p 2 . 

Lemma 3.3 (Andoni &; Indyk [AI06]). For any S > 0, dimension d > 0, and c> 1, there exists a 
c5, -sensitive family of hash functions for where every function can he evaluated 

in time 0{d ■ . 

We extend the notion of nets to c-approximate r-nets, for c > 1, r > 0, and metric space {V,d). 
These are subsets W of the points of V such that no two of the points of f/ are within distance r of each 

other, and such that every point of V is within distance c • r of a point of f/. 

Theorem 3.4. Let d > 0, r > 0, e > 0. Given a set X of n points in one can compute in expected 
running time O ^e“ 2 ^i+i/(i+£) 2 +o(i)^ ^ Af F X such that Af is a {1 e)-approximate r-net for the 
Euclidean metric on X with high probability. 

Proof: By Lemma 3.1 we may assume that d = 0{\ogn/e‘^), since otherwise we can embed X into 
Euclidean space of dimension 0{\ogn/e‘^), with distortion 1 + 0(e). Any (1 + 0(e))-approximate r-net 
for this new point set is a (1 -|- 0(e))-approximate r-net for X. 

Let c = 1 -|- e. Let H be the (r, cr,pi,p 2 )-sensitive family of hash functions given by Lemma 3.3, 

with Pi = 1 /n^G^+op)^ = 1/n. We sample k = 0((l/pi) -logn) = hash functions 

hi,... ,hk E H. For every f G {1, ..., k}, and for every x E X, we evaluate hi{x). 

We construct a set Af F X which is initially empty, and it will be the desired net at the end of the 

algorithm. Initially, we consider all points in X as being unmarked. We pick an arbitrary ordering 
Xi,..., of the points in X, and we iterate over all points in this order. When the iteration reaches point 
Xi, if it is already marked, we skip it, and continue with the next point. Otherwise, we add Xi to Af, and 
we proceed as follows. Let Mj be the set of all currently marked points. Let Si = ljj=i h~^{xi)\Mi be the 
set of unmarked points that are hashed to the same value in at least one of the sampled hash functions 
hi, ... ,hk. We mark all points y E Si, such that || x* — |/|| 2 < c-r. This completes the construction of 
the desired set Af. 

We next argue that Af is indeed a c-approximate r-net. Every point y E X \ Af must have been 
marked when considering some earlier point Xj G Af, implying that ||xj — |/||2 < c-r. Thus, every non-net 
point is covered by a net point. On the other hand, consider a pair of points Xi,Xj {i < j) for which 
< r. Then with high probability, there exists G {1,... ,k}, with htfxi) = htfxj). If so, Xj 
will be marked when we consider Xj, preventing it from belonging to Af. Therefore, with high probability, 
we have that for any x, x' E Af, ||x — x '||2 > r. This establishes that Af is indeed a c-approximate r-net. 
It remains to bound the running time. We perform n-k hash function evaluations in time 0{d ■ = 

each, totaling time The remaining time is dominated by 0(X]r=i 

Each point is marked at most once, so \Si\) = 0{n -\- L), where L is the total number of false 

positives, i.e. the number of triples x, y, t with x,y E X, \\x — y\\2>c-r,tE{l,...,k}, and ht{x) = htiy). 
For any x,y E X, with ||x — y\\2 > c ■ r, and for any t E {1,..., k}, we have Pr[ht(x) = ht{y)] < 1/n. 
Since there are 0{n'^) pairs of points, we conclude that the expected number of false positives is 

E[L] < ZlLi 1/’^) ^ We conclude that the total expected running time of 

the algorithm is as required. 
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3.2. An approximation whose time depends on the spread 

Lemma 3.5. Let d > 1, let X be a set of n points in let e > 0, and let $ be the spread of the 
Euclidean metric on X. Then, one can compute in O ^e“ 2 ^i+i/(i+£) 2 +o(i) iQg<j)^ expected time a sequence 
that is a (1 + e)-greedy permutation for the Euclidean metric on X, with high probability. 

Proof: We use the algorithm from Lemma 2.7. A 2-approximate diameter can easily be computed in 
linear time, by choosing one point arbitrarily and hnding a second point as far from it as possible. The 
only new needed observation is that it is sufficient for the algorithm to compute (1 + 0(£))-approximate 
r-nets, using Theorem 3.4, in place of r-nets. As in Lemma 2.7, the approximate r-net algorithm needs to 
be modihed to mark near-neighbors of previously selected points as used, so that they are not selected as 
part of the net; this step does not increase the total running time for the approximate r-net construction. 
The rest of the analysis remains the same. 

3.3. Approximating all-pairs min-max paths 

As a tool for eliminating the dependence on the spread in our approximate greedy permutation algorithm, 
we will use an approximation to the minimum spanning tree. However, we do not wish to approximate 
the total edge length of the tree, as has been claimed by Andoni and Indyk [AI06]; rather, we wish to 
approximate a different property of the minimum spanning tree, the fact that for every two vertices it 
provides a path that minimizes the maximum edge length among all paths connecting the same two 
vertices. 

Lemma 3.6 ([AI06]). Given n points in a Euclidean space of dimension d, and given a parameter 
c> 1, we may preprocess the points in time and space o(^dn + q data structure that 

may be used to answer c-approximate nearest neighbor queries in query time with high 

probability of correctness. 

Lemma 3.7. Given n points in a Euclidean space of dimension d, and given a parameter e > 0, we may 
in expected time O j find a spanning tree T of the points such that, for every two points u 

and v, the maximum edge length of the path in T from u to v is at most (1 -|- e) times the maximum edge 
length of the path in the minimum spanning tree from u to v. 

Proof: As before, we may assume without loss of generality that d = 0{\ogn/e‘^). We build T by an 
approximate version of Boruvka’s algorithm, in which we maintain a forest (initially having a separate 
one-node tree per point) and then in a sequence of O(logn) stages add to the forest the edge connecting 
each tree with its (approximate) nearest neighbor. 

In each stage, we assign each tree of the forest an 0(log?7,)-bit identiher. For each pair [i, b) where i is 
one of the 0(log?7,) bit positions of these identihers and b is zero or one, we build a (1 -1- £)-approximate 
nearest neighbor data structure for the points whose tree has b in the Ah bit of its identiher, for a total of 
O(logn) structures. Then, for each point p of the input set, we use these data structures to hnd O(logn) 
candidate neighbors of p, one for each of the O(logn) structures that do not contain p. This gives us a 
set of 0(n log n) candidate edges, among which we select for each tree of the current forest the shortest 
edge that has one endpoint in that tree. As in Boruvka’s algorithm, with an appropriate tie-breaking 
rule, adding the selected edges to the forest does not produce any cycles, and reduces the number of 
trees in the forest by at least a factor of two, so after O(logn) stages we will have a single tree, which we 
return as our result. 
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To show that this tree has the desired approximation property, consider a complete graph Kn on 
the input points, in which the weight of an edge that does not belong to the output tree is the distance 
between the points. However, in this graph, we set the weight of an edge e that does belong to the tree 
by letting Ti and T 2 be the two trees containing the endpoints of e in the last stage of the algorithm 
for which these endpoints belonged to different trees, and setting the weight of e to be the minimum 
distance between a pair of vertices one of which belongs to Ti or T 2 and the other of which does not 
belong to the same tree. Then, by the correctness of the approximate nearest neighbor data structure, 
the weight of e is at most equal to the distance between its endpoints and at least equal to that distance 
divided by 1 + e. However (with an appropriate tie-breaking rule) the algorithm we followed to construct 
our tree T is exactly the usual version of Boruvka’s algorithm as applied to the weighted graph Kn- 
Therefore, for every u and u, the path from u to u in T exactly minimizes the maximum edge length 
among all paths in and thus has the desired approximation for the original distances. 

3.4. Eliminating the dependence on the spread 

As in Section 2.3, we will eliminate the log$ term in the running time for our approximate greedy 
permutation algorithm by, in effect, contracting and uncontracting edges of a graph, the approximate 
minimum spanning tree of Section 3.3. 

Theorem 3.8. Let d>l, let X be a set of n points in and let e > 0, Then, one can compute in 

expected time a sequence that zs a (1 -|- e)-greedy permutation for the Euclidean 
metric on X, with high probability. 

Proof: We maintain a partition of the input into subproblems, dehned by subtrees of the spanning tree 
T computed by Lemma 3.7. Initially, there is one subproblem, dehned by the whole tree, but it does not 
include all the input points. Rather, as the algorithm progresses, certain points within each subproblem’s 
subtree will be active, depending on the current value r for which we are computing approximate r-nets. 

We delete edge e from tree T, splitting its subproblem into two smaller sub-subproblems, whenever r 
becomes smaller than the length of e divided by 1 + 0(e). After this point, the points on one side of e are 
too far away to affect the choices made on the other side of e. We will include the endpoints of an edge e 
into the active points of the subproblem containing e whenever the current value of r becomes smaller 
than cn/e times the length of e (for an appropriate constant c < 1). Until that time there will always be 
another active point within distance cer of e, so omitting the endpoints of e will not signihcantly affect 
the approximation quality of the greedy permutation we construct. 

At each stage of the algorithm, when we construct approximate r-nets for some particular value 
of r, we do so separately in each subproblem that has two or more active points. (In a subproblem 
with only one active point, that point must have been included in the approximate greedy permutation 
already, and so cannot be chosen for the new r-net. However, the points that have been included in the 
permutation must remain active in order to prevent other points within distance r of them from being 
added to the net.) Each edge e contributes to the size of a subproblem only for a logarithmic number of 
different values of r, so the total time is as stated. 

4. Exact greedy permutation for bounded treewidth 

For restricted classes of graphs, we can compute a greedy permutation exactly, more quickly than the naive 
algorithm of Section 2. Our algorithms follow Cabello and Knauer [CK09] in applying orthogonal range 
searching data structures to the vectors of distances from small sets of separator vertices. Specihcally, 
we need the following result. 
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Lemma 4.1 ([GBT84]). Let S be a set of n points in for a constant k > 1. Then we may process 
S in time and space 0(n\og^~^ n) so that the loo-nearest neighbor in S of any query point q may he 
found in query time 0(log^~^ n). 

4.1. Tree decomposition and restricted partitions 

A tree decomposition of a graph G is a tree V whose nodes are associated with sets of vertices of G 
called bags, satisfying the following two properties: 

(A) For each vertex v of G, the bags containing v indnce a connected snbtree of V. 

(B) For each edge uv of G, there exists a bag in V containing both u and v. 

The width of a tree decomposition is one less than the maximnm size of a bag, and the treewidth of a 
graph is the minimnm width of any of its tree decompositions. 

Following Frederickson [Fre97], who defined a similar concept on trees, we define a restricted order- 
k partition of a graph G of treewidth tc to be a partition of the edges of G into 0{n/k) snbgraphs Si 
such that, for all i, the subgraph Si has at most k edges, and such that, for each Si, at most 2 w + 2 
vertices are endpoints of edges both in and outside Si. 

Lemma 4.2. Fix w = 0(1). Then for every n-vertex graph G of treewidth < w and every k > (^), a 
restricted order-k partition of G may he constructed in time 0{n) from a tree decomposition of G. 

Proof: Let "D be a tree decomposition of P, of width w. Without loss of generality (by choosing a root 
arbitrarily and splitting some nodes of P into multiple nodes having equal bags) we may assume that P 
is a rooted binary tree. Associate each edge of G with a unique node of P having the two endpoints of 
the edge in its bag, choosing arbitrarily if there is more than one node. 

Following Frederickson [Fre97], hnd a partition of the nodes of P into subsets Di, having the following 
properties: 

(A) Each subset Di induces a connected subtree of P. 

(B) Each subset Di is associated with at most k edges of G. 

(C) For each i, if Di contains more than one node, then at most two edges of P have one endpoint 
in Di and the other endpoint in a different subset. 

(D) No two adjacent subsets Di can be merged while preserving properties (A), (B), and (C). 

This partition may be found in linear time by a greedy bottom-up traversal of P. For each subset Di, we 
form a subgraph Si of G consisting of the edges associated with nodes in Di. Property (B) implies that 
each subgraph Si has at most k edges, and property (C) implies that there are at most 2 w + 2 vertices 
that are endpoints of edges both in Si and in other subgraphs (namely the vertices in the bags of the 
two nodes of Di that are endpoints of edges connecting Di to other sets). 

It remains to show that there are 0{n/k) subsets. Contracting each subset Di in "D to a single node 
produces a binary tree T in which each edge with one degree-one endpoint or two degree-two endpoints 
connects subgraphs that together have more than k edges of G, for otherwise these two subgraphs would 
have been merged. It follows that T has 0{n/k) such edges, and therefore that it has only 0{n/k) nodes. 
Thus, we have formed 0{n/k) subsets Si, as desired. 

Given a restricted partition of G, we dehne an interior vertex of a subgraph Si to be a vertex of G 
all of whose incident edges belong to Si, and we dehne a boundary vertex of Si to be a vertex that is 
incident to an edge in Si but is not an interior vertex of Si. 
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4.2. The algorithm 

Suppose graph G has treewidth w = 0(1). We may hnd a greedy permutation of G by performing the 
following steps. 

1. Find a tree-decomposition of G of width w [Bod96]. 

2. Apply Lemma 4.2 to hnd a restricted oidei-y/n partition of G. 

3. Construct a weighted graph H whose vertices are the boundary vertices of the restricted partition, 
where two vertices u and v are connected by an edge if they belong to the same subgraph Si of the 
partition, and where the weight of this edge is the length of the shortest path in Si from u to n (or 
a sufficiently large dummy value Z, greater than n times the heaviest edge of G, if no such path 
exists). 

4. For each vertex v of H, initialize a value d{v) to Z. As the algorithm progresses, d{v) will represent 
the length of the shortest path to v from a vertex already belonging to the greedy permutation. 

5. For each subgraph Si of the restricted partition, having k boundary vertices, construct a {k + 1)- 
dimensional ^oo-nearest neighbor data structure (Lemma 4.1) whose points correspond to interior 
vertices of Si. The hrst coordinate of the point for v is the length of the shortest path within Si to 
V from an interior vertex of Si that belongs to the greedy permutation; initially, and until such a 
path exists, it is Z. The remaining coordinates give the lengths of the shortest paths in Si from 
each boundary vertex to n, or Z if no such path exists. 

6. Repeat n times: 

(a) For each subgraph Si of the restricted partition, hnd the farthest interior vertex of Si from 
the already-selected vertices, and its distance from the vertices selected so far. This may be 
done by querying the nearest neighbor of a point q whose hrst coordinate is 2Z and whose ith 
coordinate is 2Z — d{vi), where Vi is the ith boundary vertex of Si. 

(b) Among the 0{^/n) vertices found in the previous step, and the 0(^/n) boundary vertices whose 
distance d(v) from the greedy permutation was already known, select a vertex v whose distance 
from the greedy permutation is maximum, and add it to the permutation. 

(c) If V is an interior vertex of a subgraph Si of the restricted partition, use Dijkstra’s algorithm 
to compute the shortest path within Si from it to the other interior vertices, and rebuild the 
nearest neighbor data structure associated with Si after updating the hrst coordinates of each 
of its points. 

(d) Use Dijkstra’s algorithm on II (if n is a boundary vertex) or on if -|- Si (if it is interior to Si), 
starting from v, to update the values d(u) for each boundary vertex u. 

The time analysis of this algorithm is straightforward and gives us the following result. 

Theorem 4.3. Let G be an n-vertex graph of treewidth w = 0(1) with non-negative edge weights. Then 
in time O n) we may construct an exact greedy permutation for G. 

5. Counting distances in planar graphs 

In this section we give a near-linear time bicriterion approximation algorithm for counting pairs of vertices 
in a planar graph with a given pairwise distance r > 0. The result is approximate in the following sense. 
If we let c and c' be the number of pairs with distance at most r and at most (3 -|- e)r respectively, for 
some e > 0, then we output a number a G [c, c']. 

The following is due to Thorup [Tho04] (see also [KST13]). 

Theorem 5.1 (Thorup [Tho04]). For any n-vertex undirected planar graph with non-negative edge 
lengths, and for any e > 0, there exists a (1 + e)-approximate distance oracle with query time 
space 0{e~^n\ogn), and preprocessing time 0{e~’^n\o^ n). 
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The basic idea is now to recursively decompose G using planar separators. Fortunately, one can do it 
in such a way, that when looking on a patch P, with m vertices, formed by this recursive decomposition, 
the distances between the boundary vertices of P (in the original graph) are known. The details of 
how to compute this decomposition is described by Fakcharoenphol and Rao [FR06], and we recall their 
result. 

Let G = (R, E) be a graph. A patch of a graph is a subset C C R, such that the induced subgraph 
G[G] is connected. A vertex u G C is a boundary vertex if there exists a vertex u E V \ G such that 
uv E E. A hierarchical decomposition P of G is a set of subsets of the vertices of G, that can be 
described via a binary tree T, having patches of G associated with each node of it. The root r of tree is 
associated with the whole graph; that is, G(r) = R. Every node of m G T has two children Ui,U2, such 
that G{vi) U G{v 2 ) = G(-u), and |G(ui)|, |G(u 2 )| < (2/3) |G(-u)|. A leaf of this tree is associated with a 
single vertex of G. Finally, we require that for any patch G in this decomposition, the set of its boundary 
vertices dG, has at most 0('^|G|) vertices. 

For every G E H, the (inner) distance graph of G, denoted by Vc is a clique over dG, with 
u,v E dG assigned length dc[c]{u,v), i.e. equal to the shortest distance of all paths between u and 
V, that are contained entirely inside G[G]. The dense distance graph associated with P is the graph 
G' = U«cI’o. 

Theorem 5.2 (Fakcharoenphol and Rao [FR06]). Let G he an n-vertex undirected planar graph 
with non-negative edge lengths. Then, one can compute, in 0(n log^n) time, a hierarchical decomposition 
P of G, and all the inner and outer distance graphs associated with its patches (i.e., Vq for all G eTL). 

We are now ready to prove the main result of this section. 

Theorem 5.3. Let G be a given n-vertex undirected planar graph with non-negative edge lengths, let 
r > 0, and e > 0. Let c = \P<r\ and d = |P<( 3 +e)r|; see Eq. (I)p 4 . Then, one can compute, in 
0 {e~‘^n\og^ n) time, an integer a, such that c < a < d. 

Proof: We compute a hierarchical decomposition of P, and for every G G P, the distance graph Pc, in 
total time 0{n\og^ n), using Theorem 5.2. We consider all patches G G P. If |G| = 1 then we let ac = 0. 
Otherwise, our purpose here is to count the number of pairs of vertices u,v E G, such that dc{u,v) < r, 
and u and v belong to different children of G. 

So, let Gi, G 2 be the two patches that are the children of G in P. Let Pi, B 2 be the set of border 
vertices of Gi, G 2 , respectively, and let P = Pi U P 2 . Let Gi = G[Gi] U T)c 2 i nnd G 2 = G[G 2 ] U T>Ci- 
That is, Gi is the union of the subgraph of G induced on the cluster Gi, and the distance graph inside 
G 2 , and the distance graph outside Gi U G 2 . Note that (i) R(Gi) = Gi, (ii) for any u,v E R(Gi), we 
have dci{u,v) = dciuyv), and (hi) |P(Gi)| = 0(|Gi|) + 0(|dG2|^) +0(|dG|^) = 0(|Gi|). Therefore, for 
any i E {1, 2}, by running Dijkstra’s algorithm on Gj starting from P, we can compute the set of vertices 

Ui = {v E Gi \ dciv, P) < r}, 

in time 0(|Gj| logn). Moreover, for any i G {1, 2}, for any v E Gi, we can compute (within the same time 
bounds) its closest vertex in P; specihcally, a vertex Fj(u) G P, with dG'(u,Fj(u)) = dG{v,B). Let B' be 
the set of all vertices that are border vertices in some ancestor of C in P. For any u E B, i E {1, 2}, let 

Ui{u) = {vEGi |R(u)=m}\P'. 

Using Theorem 5.1 we can hnd in time 0{e~^\B\'^) all pairs of vertices u,v E B, with dG{u,v) < (1 -\-e)r. 
That is, we compute the set of border vertex pairs 

T = {(m, u) G P X P I dciu, u) < (1 + £)r} . 
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We now explicitly count all the pairs of vertices that are in distance at most r from a pair of vertices on 
the boundary, such that these boundary vertices in turn are in distance at most (1 + e)r from each other. 
That is, we set 


Finally, we return the value a = Every ordered pair (n,n) G V{G) x V{G) is counted exactly 

once in the above summation. Moreover, every pair with dciu^v) < r contributes 1, while every pair 
with dc{u,v) > (1 + s)3r contributes 0, implying that c < a < c', as required. 

It remains to bound the running time. Constructing T-L takes time O(nlog^n). The hierarchical 
decomposition has (9(logn) levels. For every level, we spend a total of 0{n\ogn) time running Dijkstra’s 
algorithm. We also spend a total time of 0{e~^n) computing the sets T. Finally, we spend 0{e~‘^n\og^ n) 
time at the preprocessing step of the distance oracle from Theorem 5.1. Therefore, the total running 
time is 0 {e~^n\og^ n), which concludes the proof. 

6. Conclusions 

We have found efficient approximation algorithms for greedy permutations in graphs and in high¬ 
dimensional Euclidean spaces, and fast exact algorithms for graphs of bounded treewidth. This implies 
(2 -|- e)-approximate fc-center clustering of graph metrics in (^^(mlog^n) time (ignoring the dependency 
on e), for all values of k simultaneously. Eor a single value of fc, and for graphs whose weights are 
positive integers, our technique can be used to construct a 2-approximation to the fc-center clustering 
in 0(mlognlogspread(G)) expected time (Theorem 2.9). This compares favorably with a signihcantly 
more complicated algorithm of Thorup [Tho05] that has running time worse by at least a factor of 
0(log^ n). 

We leave open for future research Ending other graphs in which we may construct exact greedy 
permutations more quickly than the naive algorithm. Another direction for research concerns hyperbolic 
spaces of bounded dimension. Krauthgamer and Lee [KL06] claim without details that for all £ > 0, sets 
of n points in hyperbolic spaces of bounded dimension have (1 -1- e)-Steiner spanners with 0{n) vertices 
and edges. Applying our graph algorithm to these spanners (modihed to avoid including Steiner points 
in its r-nets) would give a near-linear time approximate greedy permutation for those spaces as well, but 
perhaps a more direct and more efficient algorithm is possible. 
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